Languages cool as they expand: Allometric scaling and the decreasing need for new words
نویسندگان
چکیده
We analyze the occurrence frequencies of over 15 million words recorded in millions of books published during the past two centuries in seven different languages. For all languages and chronological subsets of the data we confirm that two scaling regimes characterize the word frequency distributions, with only the more common words obeying the classic Zipf law. Using corpora of unprecedented size, we test the allometric scaling relation between the corpus size and the vocabulary size of growing languages to demonstrate a decreasing marginal need for new words, a feature that is likely related to the underlying correlations between words. We calculate the annual growth fluctuations of word use which has a decreasing trend as the corpus size increases, indicating a slowdown in linguistic evolution following language expansion. This "cooling pattern" forms the basis of a third statistical regularity, which unlike the Zipf and the Heaps law, is dynamical in nature.
منابع مشابه
Comparative Study of the Academic Vocabulary Content of Electronic Engi-neering Corpora, GE Materials and M.S. Entrance Examinations
The importance of vocabulary learning has been underlined in the field of English for Academic Purposes (EAP) because non-English majors who require reading English texts in their fields of study have to expand their English vocabulary knowledge much more efficiently than ordinary ESL/EFL learners. Since academic vocabulary instruction in Iranian universities is realized through the use of Gene...
متن کاملRhyming Compounds as Elements of a Language Game (In Russian and English Languages)
The article is devoted to the study of composite rhyming compounds as a means of word formation games. It explores the place of this category of words in the lexical system and peculiarities of their use in the Russian and English languages. Authors of the article represent compound words as a special lexical subgroup. On the specific publicistic material are revealed the peculiarities of compo...
متن کاملA NEW HYBRID GENETIC AND SWARM OPTIMIZATION FOR EARTHQUAKE ACCELEROGRAM SCALING
Earthquake time history records are required to perform dynamic nonlinear analyses. In order to provide a suitable set of such records, they are scaled to match a target spectrum as introduced in the well-known design codes. Corresponding scaling factors are taken similar in practice however, optimizing them reduces extra-ordinary economic charge for the seismic design. In the present work a ne...
متن کاملVocabulary Lists for EAP and Conversation Students
Despite the abundance of research investigating general and academic vocabularies and developing dozens of word lists, few studies have compared academic vocabulary with general service word lists such as conversation vocabulary. Many EAP researchers assume that university students need to know all the words in West’s (1953) General Service List (GSL) as a prerequisite to academic words (e.g., ...
متن کاملForm and function in juvenile ascidians. II. Ontogenetic scaling of volumetric flow rates
Very little is known of the challenges to suspension feeding performance facing early juvenile marine invertebrates, although scaling considerations suggest juveniles are often at a disadvantage. For example, early juvenile ascidians have relatively, as well as absolutely, narrower siphons than later stages, generating high resistance to flow (Sherrard & LaBarbera 2005: Mar Ecol Prog Ser 287:12...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 2 شماره
صفحات -
تاریخ انتشار 2012